VEZHNEVETS AND FERRARI: LOOKING OUT OF THE WINDOW 1 Object localization in ImageNet by looking out of the window

نویسندگان

Alexander Vezhnevets

Vittorio Ferrari

چکیده

We propose a method for annotating the location of objects in ImageNet. Traditionally, this is cast as an image window classification problem, where each window is considered independently and scored based on its appearance alone. Instead, we propose a method which scores each candidate window in the context of all other windows in the image, taking into account their similarity in appearance space as well as their spatial relations in the image plane. We devise a fast and exact procedure to optimize our scoring function over all candidate windows in an image, and we learn its parameters using structured output regression. We demonstrate on 92000 images from ImageNet that this significantly improves localization over recent techniques that score windows in isolation [32, 35].

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Object localization in ImageNet by looking out of the window

Figure 1: Connecting the appearance and window position spaces. A window tight on the baseball (green star in the appearance space plot) and some larger windows containing it (red circles in the appearance space). Black points in appearance space represent all other candidate windows. The appearance space plots are actual datapoints, representing windows in 3-dimensional Associative Embedding o...

متن کامل

Looking out of the window: object localization by joint analysis of all windows in the image

Traditionally, object localization is cast as an image window classification problem, where each window is considered independently and scored based on its appearance alone. Instead, we propose a method which scores each candidate window in the context of all other windows in the image, taking into account their similarity in appearance space as well as their spatial relations in the image plan...

متن کامل

Context Forest for efficient object detection with large mixture models

We present Context Forest (ConF) — a technique for predicting properties of the objects in an image based on its global appearance. Compared to standard nearestneighbour techniques, ConF is more accurate, fast and memory efficient. We train ConF to predict which aspects of an object class are likely to appear in a given image (e.g. which viewpoint). This enables to speed-up multicomponent objec...

متن کامل

Context Forest for Object Class Detection

Global image appearance carries information about properties of objects in the image. For instance, a picture of a highway taken from a car is more likely to contain cars from the back viewpoint than from the side (fig. 1). This shows how the global image appearance of images can help understanding what objects are present and what they look like. Moreover, another property that can be inferred...

متن کامل

Semantic Segmentation Using Multiple Graphs with Block-Diagonal Constraints

In this paper we propose a novel method for image semantic segmentation using multiple graphs. The multiview affinity graph is constructed by leveraging the consistency between semantic space and multiple visual spaces. With block-diagonal constraints, we enforce the affinity matrix to be sparse such that the pairwise potential for dissimilar superpixels is close to zero. By a divide-and-conque...

متن کامل

ذخیره در منابع من

ذخیره در منابع من قبلا به منابع من ذحیره شده

{@ msg_add @}

با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره شماره

صفحات -

تاریخ انتشار 2015

VEZHNEVETS AND FERRARI: LOOKING OUT OF THE WINDOW 1 Object localization in ImageNet by looking out of the window

نویسندگان

چکیده

منابع مشابه

Object localization in ImageNet by looking out of the window

Looking out of the window: object localization by joint analysis of all windows in the image

Context Forest for efficient object detection with large mixture models

Context Forest for Object Class Detection

Semantic Segmentation Using Multiple Graphs with Block-Diagonal Constraints

عنوان ژورنال:

اشتراک گذاری